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Interleukin-1 receptor-associated kinases (IRAKs) are Ser/Thr protein kinases that play an important role as 
signaling mediators in the signal transduction facilitated by the Toll-like receptor (TLR) and interleukin-1 
receptor families. Among IRAK family members, IRAK4 is one of the drug targets for diseases related to the 
TLR and IL-1R signaling pathways. Experimental evidence suggests that the IRAK4 kinase domain is 
phosphorylated in its activation loop at T342, T345, and S346 in the fully activated state. However, the 
molecular interactions of subdomains within the active and inactive IRAK4 kinase domain are poorly 
understood. Hence, we employed a long-range molecular dynamics (MD) simulation to compare apo 
IRAK4 kinase domains (phosphorylated and unphosphorylated) and ATP-bound phosphorylated IRAK4 
kinase domains. The MD results strongly suggested that lobe uncoupling occurs in apo unphosphorylated 
IRAK4 kinase via the disruption of the R334/T345 and R310/T345 interaction. In addition, apo 
unphosphorylated trajectory result in high mobility, particularly in the N lobe, activation segment, helix aG, 
and its adjoining loops. The Asp-Phe-Gly (DFG) and His-Arg-Asp (HRD) conserved kinase motif analysis 
showed the importance of these motifs in IRAK4 kinase activation. This study provides important 
information on the structural dynamics of IRAK4 kinase, which will aid in inhibitor development. 

Toll-like receptors (TLRs) respond to pathogen-associated molecular patterns (PAMPs) and initiate the first 
wave of inflammatory signals and innate immune responses 1,2 . Interleukin-1 receptor (IL-1R) family cyto- 
kine receptors originate and regulate inflammatory and immune responses. The uncontrolled TLR/IL-1R 
signaling pathways, which involve interleukin-1 receptor- associated kinase 4 (IRAK4), lead to multiple diverse 
diseases ranging from chronic to autoinflammatory disorders 3,4 . Therefore, the upstream proteins involved in the 
activation of IRAK4 and signaling can be potential therapeutic drug targets. Several attempts have been made to 
modulate the activity of the drug target, and some of the drugs that were designed for these targets are now in 
clinical trials 4,5 . More than a few novel classes of inhibitors have been developed recently for the inhibition of 
IRAK4 activation. However, these inhibitors are still in preclinical trials 6 . 

IRAKs (Ser/Thr kinases) are essential mediators in TLR and IL-1R signal transduction 7,8 . Upon stimulation, 
TLRs and IL-IRs recruit the adaptor proteins and thereby form an intracellular receptor-adaptor complex 9,10 . 
This receptor-adaptor complex in turn recruits IRAKs 11 . Upon IRAK recruitment, IRAKI undergoes phosphor- 
ylation by IRAK4 on key Ser and Thr residues. IRAKI then detaches from the receptor complex and associates 
with tumor necrosis factor receptor-associated factor 6 (TRAF6); then, it binds with the TGF-(3-activated kinase 
1 - TAK1 -binding protein 1 -TAK1 -binding protein 2 (TAK1 -TAB1 -TAB2) complex, leading to the stimulation of 
nuclear factor kB (NF-kB), p38, and c-Jun N-terminal protein kinase mitogen-activated protein kinases (JNK 
MAPKs) 12,13 . Several studies have shown the involvement of IRAK4 in TLR and IL-1R signaling, particularly in 
Myd88-dependent TLR signaling 14 . Severe defects have been observed in cytokine responses in mice lacking 
IRAK4, and impaired responses have been seen in human IRAK4-deficient cells 1416 . These studies indicate that, 
despite the absolute involvement of IRAK4 in TLR signaling, inhibition of IRAK4 activation may cause too broad 
an impact on TLR/IL-1R signaling pathways. Deficiency of IRAK4 in children makes them susceptible to 
streptococcal pyogenic infection, but no severe viral or parasitic infection has been observed 17 . In IRAK4-deficient 
patients approaching adolescence, susceptibility to infections, particularly chronic infections, decreases 18,19 . 

IRAK4 may induce signals via intermolecular interactions and its kinase activity, resulting in activation of the 
downstream kinase IRAKI 14 . Experimental studies have reported that some pathways, such as NF-KB and JNK, 
require IRAK4 kinase function. However, cells expressing only inactive mutants of IRAK4 show reduced response 
in NF-kB activation compared to the wild type 20 . Studies on IRAK4 kinase-dead mutant knock-in mice from 
different strains also support the significance of IRAK4 kinase activity 2124 . Another report has also indicated that 
disruption of the formation of the receptor complex with a truncated kinase domain of IRAK4 inhibits IL-1 
signaling 17 . TLRs, such as TLR7 and TLR9, activate IRFs, which induce type I interferon (IFN). IRAK4-dependent 
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induction of IFNs has been observed for TLR7 and TLR9 signaling 
upon stimulation with TLR ligands 25 . The amelioration of disease 
symptoms observed in lupus-prone mice upon dual inhibition of 
TLR7 and TLR9 suggests that IRAK4 may be a potential therapeutic 
drug target for systemic lupus erythematous (SLE) 26,27 . A recent study 
has also reported that IRAK4 and IRAKI are associated with Vogt- 
Koyanagi-Harada (VKH) disease 28 . Therefore, therapeutic benefits 
can be achieved by modulating the function of IRAK4 via kinase 
inhibition for toning down the inflammatory responses in TLR/IL- 
1R signaling. 

The mammalian IRAK family comprises four members: IRAKI 29 , 
IRAK2 30 , IRA KM 31 and IRAK4 14 . AU IRAK family members show 
similar domain structure with an N-terminal death domain and a 
central kinase domain (KD). IRAK4 shares high similarity in the KD 
with all other family members 8 . The crystal structure of IRAK4 KD 
reveals that it is composed of two lobes: an N-terminal lobe com- 
prising mainly antiparallel P-strands and an a-helix (helix ocC); and a 
C-terminal lobe exclusively comprising a-helices (Figure 1). The 
hinge sequence connecting N and C lobes partially describes the 
adenosine triphosphate (ATP) and the ATP-competitive kinase inhi- 
bitors 32 . The N lobe contains the glycine-rich loop (GXGXcpG) 
formed by the first two P-strands. The activation segment (329- 
358) located in the C lobe contains important catalytic residues such 
as the Asp-Phe-Gly (DFG) (329-33 1 ) motif. The central region of the 
activation segment is the activation loop; autophosphorylation of 
this activation loop at T342, T345, and S346 is required for IRAK4 
kinase activity, and mutation at these positions causes significant 
reduction in the catalytic activity of IRAK4 kinase 32-34 . Out of these 
three phosphorylation sites, T345 is the only prototypical phosphor- 
esidue required for the activity of IRAK4 kinase, which is similar to 
many protein kinases 35,36 . In IRAK4, pT345 exhibits direct inter- 
action with R334 and with the S346 backbone amine. In addition, 
phosphorylation at T342 (pT342) is coordinated by T365, K367, and 
K441; phosphorylation at S346 (pS346) is highly exposed to the 
solvent, without any intramolecular interactions 32,34 . Structural ana- 




Figure 1 | Structure of IRAK4 kinase domain. The structure of IRAK4 KD 
predominantly shows the N lobe with P-strands and an a-helix, C lobe with 
exclusively a-helices. The structural parts were shown in various colors, N- 
terminal extension preceding kinase domain in red, G loop in orange, 
activation segment in magenta, hinge region in green, aDE loop in cyan, 
helix aG and its adjoining loops in blue. ATP molecule between the N and 
C lobe were shown in ball and stick model in slate gray color. 
Phosphorylated residues pT342, pT345, and pS346 are highlighted. The 
structural parts were labeled according to the crystal structure 32 . 



lysis of inactive kinases has revealed that the unphosphorylated 
activation segment can adopt different conformations 37,38 . The His- 
Arg-Asp (HRD) (309-311) motif is important in the catalytic loop 
and Arg from the HRD motif is important for the kinases regulated 
by the activation loop 35 . The apo-active crystal structure of IRAK4 
KD has been found to have two distinct conformations for helix aC 
in the N lobe: helix aC-in and helix aC-out. Although the helix aC-in 
is the active conformation, the usual salt bridge between Lys (K213) 
and Glu (E233) is absent, in contrast to the finding for other 
kinases 39,40 . Most importantly, IRAK4 has a Tyr (Y262) as the gate- 
keeper residue, which forms a hydrogen bond with E233 in the active 
form of the kinase; thus, the back pocket is not accessible to ligands. 
The other IRAK family members IRAKI, IRAK2 and IRAKM also 
have Tyr as a gatekeeper residue 32 . 

Over the past few years, molecular dynamics (MD) simulation has 
been extensively employed as a computational approach for investi- 
gating the connecting pathway between the active and inactive states 
of kinase. For example, the transition from the inactivation state to 
the activation state of the Src tyrosine kinase catalytic domain was 
studied by the string method with swarms of trajectories. This study 
suggested that activation occurs via the activation loop first, followed 
by helix ocC rotation 41 . A metadynamics sampling approach used to 
study the mechanism underlying the transition from the activation to 
inactivation state of cyclin-dependent kinase 5 (CDK5) suggests a 
two-step mechanism, that is, helix aC rotates by —45°, which is 
followed by activation loop refolding 42 . CDK2 was studied by 
Groban et at using molecular mechanics to reproduce the conforma- 
tional changes that take place on CDK2 phosphorylation. This study 
suggests that changes occur in the energy landscape because of phos- 
phorylation 43 . Hematopoietic cell kinase (HCK) was studied using 
targeted MD simulations to investigate the transition from the inact- 
ive to the active conformation. The results of this study suggest the 
structural effects due to phosphorylation of the activation loop are 
sequentially transmitted to the helix aC and the hinge region 44 . PrkC 
kinase from Bacillus subtilis were studied using MD simulations to 
examine the conformational changes caused by phosphorylation and 
ATP binding 45 . In addition, catalytically inactive EGFR conforma- 
tions of an active EGFR kinase were recently observed 46 . 

Since there is very little experimental evidence available for IRAK4 
kinase, the regulation of the mechanism of activation is not clearly 
understood. However, in comparison with other kinases, regulation 
of IRAK4 kinase may depend on activation loop phosphorylation 
and switching between helix aC-in and helix aC-out conforma- 
tions 34 . Therefore, the activation and inactivation cycle of IRAK4 
must be a multi-component process with many levels of regulation. 
To date, no crystal coordinates are available for inactive IRAK4 
kinase. Recently, IRAK4 kinase has attracted attention as a thera- 
peutic drug target in TLR signaling 6 . Hence, it is necessary to under- 
stand the inactive IRAK4 kinase to unravel the regulation 
mechanism of IRAK4 kinase. In this study, we have applied compu- 
tational approaches to compare apo IRAK4 KDs (phosphorylated 
and unphosphorylated) and ATP-bound phosphorylated IRAK4 
KDs. For simplicity, hereafter we designate unphosphorylated apo 
IRAK4 KD as uIRAK4, phosphorylated apo IRAK4 KD as pIRAK4, 
and ATP-bound phosphorylated IRAK4 KD as pIRAK4-ATP when- 
ever necessary. Most importantly, we carried out 200-ns long-range 
MD simulation for all the three forms of IRAK4 KDs to understand 
the conformational changes in uIRAK4 and pIRAK4, and also exam- 
ined the impact of the natural ligand ATP on the structural changes 
in IRAK4 KD. In addition, principal component analysis (PCA) was 
performed to understand the collective motions within uIRAK4, 
pIRAK4, and pIRAK4-ATP. 

Results 

Summary of the IRAK4 KD systems. To compare and understand 
the conformational changes of apo and ATP-bound forms of IRAK4 
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Table 1 | Summary of the trajectories subjected to molecular dynamics simulations 

Trajectory Phosphorylation state Ligand Components 

ulRAK4 Unphosphorylated Apo ulRAK4 in water 

plRAK4 pT342, pT345, and pS346 Apo plRAK4 in water 

plRAK4-ATP pT342, pT345, and pS346 ATP and 2 Mg 2+ ions plRAK4, ATP, and 2 Mg 2+ ions in water 



KD, the structural models were constructed for apo and ATP-bound 
IRAK4 KDs, as detailed in Table 1. The structural models uIRAK4, 
pIRAK4, and pIRAK4-ATP were subjected to dynamics simulations 
for the production run of 200 ns (refer to the methods section). 

Conformational changes in the activation segment. It is well 
known that phosphorylation of the activation loop is required for 
the activation of many kinases 47 and that the activation loop 
undergoes conformational changes upon switching between 
inactive and active states of the kinase. The number and location 
of phosphorylation sites within the activation loop varies between 
kinases 48 . T342, T345, and S346 are the phosphorylation sites in the 
activation loop of IRAK4 KD 33 . We were interested in understanding 
the structural changes in the activation segment (329-358) of the 
IRAK4 systems due to the phosphorylation of the activation loop. To 
investigate this, we used the SuperPose server to superimpose the 
activation segments from uIRAK4, pIRAK4, and pIRAK4-ATP 
obtained from the MD simulation (Figure 2A). The RMSD values 
of the activation segments of uIRAK4 and pIRAK4, uIRAK4 and 
pIRAK4-ATP, and pIRAK4 and pIRAK4-ATP were 4.78 A, 
5.22 A, and 4.15 A, respectively. In particular, the RMSD between 
pIRAK4 and pIRAK4- ATP was lower than the values of uIRAK4 and 
pIRAK4 and of uIRAK4 and pIRAK4-ATP. These RMSD values 
indicate a large structural variation in the activation segment upon 
activation loop phosphorylation and ATP binding to the 
phosphorylated IRAK4 kinase in order to stabilize the activation 
segment. To further elucidate the structural flexibility in the 
activation segment, we superimposed ten snapshots from 200-ns 
MD run (Supplementary Information Figure SI). From these 
results, we observed that the activation segment remained 
relatively stable in pIRAK4-ATP as expected. In contrast, both 
phosphorylated and unphosphorylated structures were unstable. 
The conformational flexibility in the pIRAK4 suggests that 
phosphorylation in the activation loop induces conformational 
changes in the activation segment. In addition, deviations in the 
activation segments upon removal of phosphate groups (uIRAK4) 



indicated that the initial inactivation dynamics of IRAK4 KD may 
impose some conformational changes on the activation segment. 

To obtain more in-depth understanding of the structural plasticity 
in the activation segment, we focused on the G331 from the DFG 
(329-33 1 ) motif and T345 in the activation loop, as these residues are 
important for kinase activity. The DFG motif is highly conserved 
in eukaryotic Ser/Thr kinase and is situated at the base of the activa- 
tion loop. In the activation state of the kinase, ATP forms 
polar interactions with DFG, either directly or indirectly, via mag- 
nesium ions. Distortions in the DFG motif have also been observed 
in some kinases due to the structural changes that originate from 
the activation loop phosphorylation 49 . The prototypical residue 
pT345 in the activation loop coordinates with the basic amino 
acids important for kinase activity. G331 from DFG exhibits vertical 
and horizontal motions in both pIRAK4 and uIRAK4. In the 
pIRAK4-ATP model, the G331 position is highly stable. This sug- 
gests that activation loop phosphorylation, or unphosphorylation 
causes the distortions in the DFG loop. The position of T345 in 
the pIRAK4-ATP model differed from that in the pIRAK4 and 
uIRAK4 models (Supplementary Information Figure SI). In 
pIRAK4 and uIRAK4, the position of the T345 showed horizontal 
and vertical motion, respectively, whereas in the pIRAK4-ATP 
model, the position of pT345 was stable with vertical motion 
(Supplementary Information Figure SI). In particular, in pIRAK4 
and pIRAK4-ATP, the phosphate group of pT345 was located close 
to the basic amino acids necessary for the stability of its activation 
segment, thereby altering the shape of the activation loop. In con- 
trast, T345 in the activation loop of uIRAK4 was located away from 
the basic amino acids. 

The RMSD plot for the residues of activation segment (Figure 2B) 
showed that uIRAK4 has larger deviations compared with pIRAK4 
and pIRAK4-ATP. It indicated that the phosphorylation of the 
activation loop and ATP binding to IRAK4 kinase may stabilize 
and lead to conformational changes in the activation segment, and 
thus may activate the kinase and allow for substrate access 36 . The 
RMSD of the DFG motif can also be correlated with the above results, 




Figure 2 | Superimposition and RMSD of the activation segment. (A) Activation segment superimposition of apo and ATP-bound structures. (B) 
Residue RMSD of the activation segments with respect to the initial structure. Apo unphosphorylated in red, apo phosphorylated in forest green, and 
phosphorylated with bound ATP in magenta color are given. These color codes are used for all the figures to represent apo and ATP-bound structures. 
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Figure 3 | Molecular interaction of T345. (A) Residue minimum distances of T345/R334, T345/R310, and T345/R347. (B) Apo and ATP-bound IRAK4 
KD is shown in ribbon model. Phosphorylation sites and interacting residues are shown in stick model. The hydrogen bond and salt bridge 
interactions are shown with blue dashes. The color codes are given in the Figure 2 legend. 



showing that the pIRAK4-ATP structure is highly stable after 
—35 ns compared with both pIRAK4 and uIRAK4 structures 
(Supplementary Information Figure S2A). Additionally, the RMSF 
of the activation segment shows that there is high residue flexibility in 
uIRAK4 in the T345 position compared with pIRAK4 and pIRAK4- 
ATP (Supplementary Information Figure S2B). The above results 
cumulatively suggest that even though the phosphorylation of the 
activation loop induces conformational changes in the DFG motif, 
activation loop phosphorylation and ATP binding are mandatory for 
the stabilization of the activation loop, and thus the whole activation 
segment is stabilized 33,45 . 

Stabilization of the basic residues (Arg triad) at the phosphoT345 
site. In general, the critical electrostatic interaction of a specific 
phosphoresidue in the activation loop with a positively charged 
(Arg) cluster leads to activation loop refold, and the loop itself is 
positioned for substrate binding. The active crystal structure of 
IRAK4 KD revealed that its activation loop phosphorylation is 
similar to that of other phosphorylated Ser/Thr/Tyr kinases 32 ' 34 . 
However, out of the three phosphorylation sites pT342, pT345, 
and pS346 in the activation loop, T345 is the primary 
phosphoresidue anchor toward the basic pocket that is conserved 
among kinases 50-51 . R310 precedes the catalytic base, and R334 and 
R347 in the activation loop are coordinated with the primary 
phosphoresidue pT345. Out of the three basic residues 
coordinating with the phosphoresidue pT345, the active crystal 
structure revealed that R334 has direct interaction with the 



phosphate group of T345. To check the consistency of the 
interactions between the phosphate group of pT345 and the basic 
pocket, we examined the MD trajectories of uIRAK4, pIRAK4, and 
pIRAK4- ATP. MD simulations of pIRAK4 and pIRK4- ATP revealed 
that R334 is coordinated with the phosphate group of pT345, with 
minimum residue distance around 2.5 A throughout the 
simulations. In contrast, on removal of the phosphate group in 
uIRAK4, we observed that the residue minimum distance of T345/ 
R334 consistently increases to —17 A (Figure 3A). Interestingly, we 
also noticed that the minimum distance of pT345/R310 during the 
simulations was within 2.5 A in pIRAK4-ATP, whereas this distance 
was up to 7 A and —8 A (till — 120 ns, distance raised up to 10 A in 
uIRAK4) in pIRAK4 and uIRAK4 structures, respectively 
(Figure 3A). In addition, the minimum distance of pT345/R347 
was 2-4 A in all three trajectories, with high fluctuation during 
simulations (Figure 3A). Firstly, the minimum distances of T345/ 
R310 and T345/R334 observed in all three simulations suggest that 
R310 from the catalytic loop and R334 form the activation loop play 
dominant roles in the anchoring of pT345, compared with R347 
upon ATP binding to the phosphorylated IRAK4 kinase. Secondly, 
there is an indication that interaction between the N lobe and C lobe 
loosens in inactive IRAK4 kinase through the disruption of the 
interaction of T345/R310 and T345/R334. The uncoupled T345/ 
R310 and T345/R334 fluctuations may reflect the intrinsic 
dynamics in IRAK4 kinase 32,34 . These results may also indicate that 
R310 and R334 complement each other upon natural ligand ATP 
binding. The minimum distance of T345/R310 in pIRAK4, that is, 
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7 A, and of 2.5 A for pIRAK4-ATP during simulation further 
indicates the role of R310 in the transition from the activation to 
the inactivation state by loss of interaction between the phosphate 
group of pT345 and the side chain of R310. It is to be noted that this 
type of interaction is important for most of the RD kinases 36,48 . 

Molecular interactions at phosphoT342 and S346 sites. To explore 
the importance of pT342 and pS346 upon activation of IRAK4 
kinase, we analyzed the interaction pattern of pT342 and pS346 
with their surrounding residues during simulations. The residue 
pT342 forms stable electrostatic interaction with T365, K367, and 
K441 in both pIRAK4 and pIRAK4-ATP structural models, which is 
consistent with the crystal coordinates. Interestingly, we observed 
that pS346 shows hydrogen bond interaction with Q228 from helix 
aC in the pIRAK4-ATP model (Figure 3B). Q228 from helix aC is 
quite distant from S346 in the uIRAK4 and pIRAK4 model, that is 
— 18 A and —15 A, respectively (Supplementary Information Figure 
S3). In the active crystal structure with AMP-PNP binding to IRAK4 
KD, pS346 is highly exposed to solvents without any intramolecular 
interaction. However, in our analysis, upon ATP and Mg 2+ ion 
binding to IRAK4 KD (pIRAK4-ATP), the side chain of the Q228 
residue reorients itself toward pS346, suggesting that, in addition to 
the Arg cluster, Q228 might also play an important role in switching 
the helix aC to the proper orientation for the catalytic residue E233. 
In most of the Ser/Thr kinases, the basic residues in the helix aC 
interact with phosphorylated residue in the activation loop 52 . In 
addition, upstream of pS346, the side chain of R347 also reorients 
itself toward pS346 upon phosphorylation and ATP binding. 
Mutation of this phosphorylation site reduces the catalytic activity 
of IRAK4 kinase up to 50% 33 , reflecting the importance of this 
phosphorylation site in the regulation mechanism of IRAK4 kinase 
and indicating that this phosphorylation site of IRAK4 KD warrants 
further investigation. 

Deviations from initial structure and residue flexibility analysis. 

RMSD values were used to investigate the structural difference 
between the apo and ATP -bound forms of IRAK4 KD. RMSD 
values of the backbone atoms for uIRAK4, pIRAK4, and pIRAK4- 
ATP are plotted as a function of time in Figure 4. The RMSD plot 
shows that all the three trajectories were able to reach reasonable 
equilibrium at —28 ns. As expected, the ATP-bound phosphorylated 
structure (pIRAK4-ATP) is more stable than the apo trajectories, 
indicating that ATP induces conformational changes in order to 
stabilize the whole structure, whereas both the apo trajectories 



uIRAK4 and pIRAK4 show slightly higher fluctuations, even after 
equilibration. In particular, uIRAK4 induces rapid deviation even 
after —60 ns. It is noteworthy that pIRAK4 shows slight stable 
deviation, indicating that phosphorylation of the activation loop 
stabilizes the activation segment, thereby altering the whole 
conformation of the structure followed by ATP binding. To 
identify the most flexible regions of apo and ATP-bound IRAK4 
KD, the RMSFs of the backbone atoms of each residue were 
calculated over the last 100 ns (100-200 ns) and are plotted in 
Figure 5. The flexible regions of both the apo trajectories are the G 
loop, the loop formed by the 2 nd and 3 rd (3-strand, the loop at the N 
terminus of the helix ocC, the N terminus of the hinge region in the N 
lobe, the aDE loop (276-282), the activation segment (329-358) 
(only for uIRAK4), the helix aEF, and the helix aG (395-406) 
(only for uIRAK4) and its adjoining loops (383-394 and 407-423) 
in the C lobe. Most of the Ser/Thr kinases show that these regions are 
involved in ATP and substrate binding 53 . In addition, the ATP- 
bound phosphorylated structure shows high fluctuations at the 
loop formed by the 2 nd and 3 rd )3-strand, the aDE loop, and 
the helix aG and its adjoining loops. Compared with other 
Ser/Thr/Tyr kinase structures, the most variable regions in the 
IRAK4 structure are the aDE loop, the activation segment, and 
the helix aG and its adjoining loops 32 . These regions show a high 
degree of residue fluctuations in uIRAK4 and pIRAK4-ATP, 
indicating that these regions may be closely involved in substrate 
binding as well as in the initial inactivation mechanism of IRAK4 
kinase. Notably, the high fluctuations observed in the loop regions of 
both the N and C lobes indicate the functional importance in IRAK4 
kinase activity. 

ATP binding site. To check whether phosphorylation alters the 
shape and size of the ATP binding pocket, we compared the 
IRAK4 structures of both the apo trajectories using the CASTp 
server and chimera tool. From this comparison, we observed that, 
in the case of phosphorylated structure, the ATP binding pocket was 
deeper and looked more accessible than in the unphosphorylated 
structure (Figure 6). The volumes of the ATP binding pockets in 
the phosphorylated and unphosphorylated structures were 1798 A 3 
(large pocket in pIRAK4) and 438.6 A 3 (large pocket in uIRAK4) 
respectively. In addition, total solvent accessible surface areas 
(SASA) of the pIRAK4 and uIRAK4 structures were 13073.5 A 2 
and 12932.7 A 2 , respectively. These results indicate that 
phosphorylation in the activation loop of IRAK4 KD alters the 
volume of the ATP binding pocket. Of the five )3-strands in the N 
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Figure 4 | RMSD of Apo and ATP-bound complex during simulation. RMSD of the backbone atoms with respect to initial structure of the apo and ATP- 
bound structures over 200-ns simulations. 
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Figure 5 | RMSF of Apo and ATP-bound complex during simulation. RMSF plot of the backbone fluctuations obtained for apo and ATP -bound 
structures over last 100 ns (100 to 200 ns) simulations. 

lobe, the first two (5-strands shift slightly to the left; in particular, the phosphorylated IRAK4 KD undergoes conformational changes to 
G loop formed by the first two (3-strands moves a little away to access enlarge the surface that is accessible for ATP binding. In addition, 
the ATP molecule and thereby forms a deeper ATP binding pocket upon unphosphorylation (inactivation) of IRAK4 kinase, the N and 
upon phosphorylation (Figure 6). This result suggests that the fully C lobes move against each other via a hinge motion. 




Figure 6 | Structural difference in the ATP binding pocket. Surface and ribbon representation (top) of unphosphorylated and phosphorylated apo 
structures with ATP binding pocket (magenta rectangle). Structural differences in the ATP binding pocket are shown by superimposing both the apo 
structures (bottom). Color codes are given in the Figure 2 legend. 
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Time (ns) 

Figure 7 | Helix aC orientation. (A) The structural representation of the key residues and their interactions are shown to understand the helix aC 
orientation. Atoms are shown in stick representation. (B) The minimum distance is shown between the NiJ of K213 and CS of E233. (C) Structural 
superimposition of the helix aC from apo and ATP-bound structures. Color codes are given in the Figure 2 legend. 



Helix aC orientation. It has been consistently observed that helix aC 
in the (3-sheet- rich N lobe is an important mediator for the regulation 
mechanism of kinase, as this helix comprises a Glu residue that is 
crucial for the salt bridge interaction with the conserved Lys residue 
from the [33 strand. This conserved Lys residue in turn coordinates 
with the a- and [3-phosphate groups of ATP 54 . Helix aC often makes 
direct contact with the DFG motif, which is toward the N-terminal of 
the activation segment. Its role in kinase regulation is indicated by its 
proximity in the active state to the activation loop 54 . In both apo and 
ATP-bound IRAK4 structures, the helix aC was found to have stable 
deviation from the initial structure during simulations 
(Supplementary Information Figure S4A). The IRAK4 active 
crystal structure shows two different conformations, which are 
opposite to those of the other Ser/Thr kinase, i.e., helix ocC-in 
contains no salt bridge, but helix aC-out contains a salt bridge 
interaction between K213 from the (33 strand and E233 from helix 
aC 3439,40 . Hence, the distance between the amino group of K213 and 
the carboxylate group of E233 was monitored throughout the 
simulations for both the apo and ATP-bound trajectories. 
Interestingly, the distance showed a remarkable result in terms of 
ATP binding, as well as the helix aC. In the case of pIRAK4- ATP, the 
distance was 3.84 A, whereas in both apo trajectories the distance 
was 5.05 A for pIRAK4 and 3.47 A for uIRAK4. Notably, in the 
pIRAK4-ATP structure, the distance was —5.5 A throughout the 
simulations up to 190 ns (Figures 7 A and B). This result suggests 
that binding of ATP leads to further new interactions i.e., the 
negatively charged E233 coordinates with the positively charged 
Mg 2+ ion. In contrast, the positively charged K213 interacts with 
the negatively charged phosphates of ATP. In addition, we 
observed only helix ocC-in conformation in all three trajectories, 
since the distances mentioned above are too long to be considered 
salt bridges. This may be because the initial structure used for MD 



simulation was in the helix aC-in conformation. However, it is worth 
mentioning that the salt bridge distance between K213 and E233 is 
closer in uIRAK4 compared with pIRAK4. 

Further, we were interested in the role of the gatekeeper residue 
Y262 in the phosphorylation of the kinase. Y262 hydrogen bonds 
with E233 in unphosphorylated IRAK4 and the ATP-bound phos- 
phorylated IRAK4 model. In contrast, the distance was about 3.59 A 
in phosphorylated IRAK4 model (Figure 7A). In pIRAK4, the residue 
distance was initially within 2.5 A. However, after —25 ns, the dis- 
tance increased to —4.5 A (Supplementary Information Figure S4B). 
Therefore, the structural rearrangement occurs in the helix aC with 
slight rotation and bending of the helix. However, this cannot be 
considered as the helix txC-out conformation as we did not observe 
a K213-E233 salt bridge in the pIRAK4 structure. In addition, the 
RMSD of the superimposed helix aC from both apo and ATP-bound 
models were 2.25 A for uIRAK4 and pIRAK4, 2.30 A for uIRAK4 
and pIRAK4-ATP, and 2.04 A for pIRAK4 and pIRAK4-ATP 
(Figure 7C). These RMSD values, in particular 2.25 A for uIRAK4 
and pIRAK4 and 2.04 A for pIRAK4 and pIRAK4- ATP, suggest that 
both the helix aC-in and helix aC-out conformations are possible 
upon activation loop phosphorylation. In several kinases, the hinge 
movement of helix txC away from the ATP binding site is required to 
switch from the catalytically active to the inactive state 55 . The above 
results suggest that unphosphorylated IRAK4 KD (inactive KD) 
exists in helix aC-in conformation; once unphosphorylated IRAK4 
becomes phosphorylated it may exist in both helix txC-in and -out 
conformations (or more), and finally when ATP binds between N 
and C terminal lobes, may shift this equilibrium to the helix ocC-in 
position. In addition, E233 forms a hydrogen bond with the back- 
bone amine of F330 in all three forms (Figure 7A). However, the 
residue distance between E233 and F330 in uIRAK4 is higher com- 
pared with pIRAK4 and pIRAK4-ATP (Supplementary Information 
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Figure 8 | DFG motif conformation. Structural representation of DFG motif from apo and ATP-bound structures are shown. Atoms are shown in stick 
representation. The distances between NH of G331 and oxygen of D329 are shown in blue dashes. The superimposition of DFG motif form apo and ATP 
bound structures are shown in the right bottom corner. Color codes are given in the Figure 2 legend. 



Figure S4B). Hence, we speculate F330 may also be indirectly 
involved in the switching mechanism of helix aC. 

DFG and HRD motifs. Aspartic acids from DFG (329-331) and 
HRD (309-311) motifs are two critical residues responsible for the 
catalytic mechanism of many kinases. The structure-function 
relationship of the kinases, particularly the conformational changes 
in the DFG motif (i.e., the correlation of DFG flip with enzyme 
activity), has been widely studied. However, the exact mechanism 
of the DFG flip remains uncertain 52,56,57 . In active kinases, the DFG 
motif adopts DFG-in conformation as the Asp coordinates the 
magnesium ion, which in turn chelates ATP phosphates; this 
region of the kinase is called the magnesium-binding loop. The 
HRD motif acts as a base catalyst that deprotonates the Ser/Thr of 
the substrate side chain, and this region may be considered as a 
catalytic loop. Therefore, we were interested in understanding the 
DFG and HRD position in both apo and ATP-bound IRAK4 KD 58 . 

In our study, we observed two different DFG conformations for 
both ATP bound and apo IRAK4 KD. In the case of the ATP-bound 
phosphorylated structure, D329 coordinated with the Mg 2+ ion, 
thereby adopting DFG-in conformation, whereas in the apo struc- 
tures D329 does not fully orient its side chain to coordinate with the 
Mg 2 * ion (Figure 8). The side chain position of the phenylalanine in 
the ATP-bound form points toward the ATP binding site. In contrast, 
in both the apo structures uIRAK4 and pIRAK4, the side chain of 



F330 points toward the helix ocC and helix aE respectively. Therefore, 
we speculate that F330 may be involved in hydrophobic interactions 
of hydrophobic residues involving ATP binding. Kornev et a/ 56 sug- 
gested that the conformation of Asp in the active kinase relies on the 
hydrogen bond between the N-H group of Gly and the oxygen of Asp 
and that disruption of this hydrogen bond resulted in the possibility of 
DFG-flip. However, the Gly role in the DFG motif is not clearly 
defined. To test the above hypothesis for IRAK4 kinase, we measured 
the distance from the oxygen of D329 to the hydrogen of G331 from 
the three models: the distances in uIRAK4, pIRAK4, and pIRAK4- 
ATP were 6.27 A, 6.30 A, and 5.91 A, respectively (Figure 8). Even 
though these distances cannot be considered as hydrogen bonding, 
these measurements may still be a reflection of the interaction, par- 
ticularly upon ATP binding, to stabilize the position of aspartic acid 45 . 

The other conserved motif in the kinase is the HRD motif (triad). 
In order to understand the active conformation of IRAK4, we inves- 
tigated the HRD motif. In general, it is known that in Ser/Thr kinases, 
Arg from HRD interacts with the phosphate group from the activa- 
tion loop 35,36 . In the case of the IRAK4 active crystal structure, R334 is 
the only residue that interacts with the phosphate group of T345 in 
the activation loop. However, during simulation, R310 maintained 
salt bridge interaction with pT345 in the ATP-bound phosphory- 
lated IRAK4 structure. In contrast, this interaction was not observed 
in both the apo trajectories (Figure 3B). The minimum distances 
between T345 and R310 are shown in Figure 3 A. In the case of 
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Figure 9 | Projection of IRAK4 KD trajectories in phase space. (A) The first 1 5 eigenvectors of the covariance matrix and the lines with cross symbol 
represent the cumulative sum of the contribution to the total fluctuations. The clouds represent the projection of the trajectories of IRAK4 KD (B) 
eigenvector 1 and 2 (C), eigenvector 2 and 3 (D) eigenvector 10 and 15. We performed the analysis with 50-ps coordinates of the 200-ns trajectory; thus, 
we used 4000 frames of the trajectory for the projection. The color codes for the clouds are given in the Figure 2 legend. 



uIRAK4 and pIRAK4, the distance between R310 and T345 is too 
long. Additionally, we analyzed the position of invariant D311, in 
both apo and ATP-bound structures and the distance between D31 1 
and nearby K313 was around 2-4 A (data not shown). These dis- 
tances suggest that D311 and K313 may form a salt bridge 
(Supplementary Information Figure S5). Additional interaction of 
D311 with T351 from the substrate-binding loop was observed in 
both apo structures (Supplementary Information Figure S5). D311 
and K313 are greatly conserved in Ser/Thr kinases and the charged 
side chain groups of these residues are involved in the electrostatic 
stabilization of the transition state 59 . Our results strongly support the 
role of D311, which may trigger the hydroxyl group of the substrate, 
in the catalytic mechanism of IRAK4 kinase. 

Collective modes of motion in apo and ATP-bound IRAK4 KD. 

To better understand the conformational changes of the apo and 
ATP-bound IRAK4 KD structures, the MD trajectories of each 
system were inspected with the principal components. Approxi- 
mately, the first 15 modes contribute greatly to the collective 
motions, as only the first 15 eigenvectors showed values greater 
than 1 nm 2 . The first 15 collective modes for each system with 
cumulative percentage are shown in Figure 9A. The application of 
PCA components for uIRAK4, pIRAK4, and pIRAK4-ATP 
structures and the motions of simulations showed 68%, 49%, and 
54%, respectively, and accounted for the first 15 eigenvectors 
(Figure 9A). From these results, we observed that the internal 
motion of both apo and ATP-bound IRAK4 KD is limited to 
subspace with fewer dimensions. The projection of IRAK4 KD, 
defined by eigenvectors 1 and 2, 2 and 3, 10 and 15 from each MD 



trajectory, allows visualization onto the essential space (Figures 9B, 
C, and D). In particular, the projection of MD trajectories onto 
eigenvectors 1 and 2 showed that all the three trajectories were 
sufficient to reach distinct minima with small energy barriers. 
Distribution of all three systems showed that both apo and ATP- 
bound IRAK4 undergo conformational changes. In particular, 
uIRAK4 projection, which indicates the mechanism of inactivation 
of IRAK4 kinase, undergoes large conformational changes. The 
pIRAK4 model shows dissimilar eigenvectors compared to the 
uIRAK4 and pIRAK4-ATP trajectories. However, all the three 
trajectories showed extensive sampling of phase space until the 
system reached a different equilibrium on the energy landscape. 
Figures 9C and D show that the motions of the protein attain their 
equilibrium fluctuations within 15 eigenvectors. Thus, the three well- 
defined clusters seen in Figures 9B, 9C, and 9D indicate the 
conformational changes that may occur upon phosphorylation and 
ATP binding to the unphosphorylated IRAK4 kinase as well as upon 
inactivation of IRAK4 kinase. 

The subdomain motions within IRAK4 were analyzed by the cor- 
responding motion mode of the first eigenvector, as seen in Figure 10. 
The animation of motions for 200-ns MD trajectories of the three 
systems can be seen in the movies SI -S3 for eigenvector 1 and S4-S6 
for eigenvector 2. As expected, large domain motions were observed 
for the apo trajectories compared with those for the ATP-bound 
form. In unphosphorylated IRAK4 kinase, overall, the N and C lobes 
move toward each other around the hinge region (Figure 10, 
Supplementary Information Movie SI). The hinge region connects 
the N and C lobes and interacts with the edge of the ATP molecule. 
Therefore, the N and C lobes independently contribute to the surface 
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of the ATP binding site, in particular the activation loop in the 
activation segment moves toward the C-lobe helices and helix aG 
moves toward the activation segment. The second dominant motion 
of uIRAK4 is in the opposite direction to the C lobe, including the 
activation segment with the N lobe (Supplementary Information 
Movie S4). The N lobe and helix ocG, and its adjoining loops, show 
more pronounced dynamics in unphosphorylated IRAK4 KD, sug- 
gesting that these regions may play a role in the inactivation 
dynamics of IRAK4 kinase. 

In the phosphorylated IRAK4 trajectory, the first dominant mode 
can be described as the breathing mode, in which helix aC and the G 
loop from the N lobe move against the C lobe, particularly against the 
activation segment and helix aG and its adjoining loop (Figure 10, 
Supplementary Information Movie S2). Strikingly, the activation 
segment showed less pronounced motion, which is consistent with 
the PvMSF calculations. Similar motions were found in other 
kinases 53,60 . The most significant motion of the second dominant 
mode in pIRAK4 was the N lobe moving against helix aG and its 
adjoining loops and activation segment (Supplementary Information 
Movie S5). The first dominant mode in ATP-bound IRAK4 involved 
helix aG and its adjoining loops, and the aDE loop from the C lobe 
moved against the activation segment, in particular substrate binding 
site, while the N lobe remained stable (Figure 10, Supplementary 
Information Movie S3). The motion of the second dominant mode 
in pIRAK4-ATP involved helix aG and its adjoining loop, and the 
aDE loop moved in the opposite direction to the activation segment 
(Supplementary Information Movie S6). The dominant motions of 
helix aG and the aDE loop upon ATP binding to the phosphorylated 
IRAK4 suggested the importance of helix aG and the aDE loop 
during the phosphorylation transition state catalyzed by the 
IRAK4 kinase. 

Discussion 

Protein kinases have been characterized as one of the essential com- 
ponents in the cell signaling pathways of all eukaryotic organisms 61,62 . 
Several studies have indicated that IRAK4 may be a suitable thera- 
peutic drug target for inhibiting the production of proinflammatory 
genes via the TLR/IL-1R signaling pathways 6,21,28,63 . In this study, 
comparative molecular dynamics simulations were performed with 
apo and ATP-bound IRAK4 KD structures. At the 200-ns timescale 
of IRAK4 KDs, the activation segment of the unphosphorylated 
IRAK4 (uIRAK4) structure showed a high degree of diversity com- 
pared to that of the phosphorylated IRAK4 kinase (pIRAK4 and 
pIRAK4-ATP) (Figure 2A). In the uIRAK4 model, the activation 
loop appears to be located in close proximity to the C lobe, which 
may allow closing of the substrate docking site, indicating the inact- 
ive nature of the kinase. However, in active IRAK4 kinase, the Arg 
triad may pull the whole activation loop, thereby exposing the p + 1 
loop for the IRAKI substrate. A wide variety of inactive conforma- 
tions of the activation segment have previously been observed, and in 
some cases the activation segment appeared to be quite disordered 48 . 
Crystal coordinates have shown that active IRAK4 KD activation 



loop phosphorylation is similar to that of other Ser/Thr/Tyr kinases 32 
such as ERK2 (MAP kinase), cAPK, CDK2, LCK, and IRK. Although 
IRAK4 kinase is a Ser/Thr kinase, the phosphate interaction pattern 
of active IRAK4 kinase is closely related to that of Tyr kinases such as 
LCK, IRK, IGF1RK, and JAK3, i.e., single primary phosphate ligand 
interaction with Arg or Lys, equivalent to Arg334 of IRAK4 kinase 32 . 
Interestingly, the pIRAK4-ATP model showed direct interaction of 
pT345 with R310 and R334 throughout the simulations, indicating 
that, upon ATP binding to IRAK4 kinase, R310 and R334 are mutu- 
ally complementary (Figure 3A). Most Ser/Thr kinases have been 
observed to possess two or three phosphate-binding residues, and 
in most RD kinases, the arginine from the HRD (YRD in PKA) motif 
interacts with the primary phosphorylation site upon activa- 
tion 36,48,62 . However, in pIRAK4, pT345 showed direct interaction 
with R334 alone, whereas in uIRAK4 T345/R334 was quite distant, 
indicating that the lobe separation in the inactivation mechanism of 
IRAK4 kinase is facilitated through disruption of the interaction of 
pT345 with R310 and R334 (Figure 3A). Upon ATP binding, hydro- 
gen bond interaction was observed between the phosphate group of 
pS346 and Q228 (polar residue) from helix aC, whereas in apo 
IRAK4 these residues are relatively distant (Figure 3B) 
(Supplementary Information Figure S3). Mutation of this residue 
reduces IRAK4 kinase activity up to 50% 33 . This type of interaction 
has previously been observed in most of the Ser/Thr kinases such as 
active MAPK kinase (R68-pT183), in which the PKB (H196-pT309) 
basic residue from helix aC interacts with the phosphorylated res- 
idue from the activation loop, whereas Tyr kinases lack such inter- 
actions 52 . This analysis reveals that, upon activation of IRAK4 kinase, 
apart from the Arg triad, the Q228 from helix aC may also be 
involved in switching the helix aC back to close proximity to the 
DFG motif at the N-terminus of the activation segment, thereby 
facilitating the proper orientation for the DFG flip. 

The ATP binding site is located between two lobes of the kinase 
domain. ATP binding volumes varied upon phosphorylation and 
unphosphorylation of IRAK4 KD, suggesting that upon inactivation 
of IRAK4 kinase, the motion of the hinge region may allow closure of 
the N and C lobes. In PKA, the relative orientation of the N and C 
lobes varies from open in the inactive conformation to closed in the 
active conformation via hinge motion, enabling tight binding of ATP 
molecule 61 . The helix aC in the N lobe of the kinase is a unique and 
dynamic regulatory element. In active Ser/Thr kinase, an electro- 
static interaction was observed between Lys from the (33 strand 
and Glu from helix aC, indicating the helix aC-in conformation, 
whereas the inactive kinase maintains the helix aC-out conformation 
(i.e., no salt bridge interaction is formed between Lys and Glu). The 
opposite is observed in IRAK4 kinase; in both the apo and ATP- 
bound models, the helix aC-in conformation was maintained, as no 
salt bridge interaction was formed between the conserved K213 and 
E233 residues (throughout in the simulations the residue distance 
remained —3.5-5.5 A) (Figure 7). The superimposed RMSD differ- 
ences in the helix aC suggest that it may appear in two different 
conformations in solution upon phosphorylation of the IRAK4 
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kinase (helix aC-in and -out). In Src-family tyrosine kinases, the 
intramolecular domains SH3 and SH2 are involved in the regulatory 
mechanism 64,65 , and it may be worth noting that the helix aC switch 
in IRAK4 kinase may be triggered by the death domain as well as the 
undetermined domain (via intramolecular interactions) of the 
IRAK4 kinase, or by an adaptor protein such as MyD88 (via inter- 
molecular interactions). The "gatekeeper" residue plays an import- 
ant role in the activation of the kinase, and although 77% of human 
kinases have large gatekeeper residues (Leu, Met, and Phe), 21% are 
tyrosine kinases, with small residues (Thr, Val). IRAK4 has a Tyr 
residue as a gatekeeper; comparison with the 400 kinases has shown 
that this Tyr is unique to the IRAK family 32 , as well as the plant 
receptor-like kinases (RLKs) 66 . Recently, it has been reported that 
kinases belonging to the IRAK family are very similar in structure 
and function to plant RLKs such as LYK3 67 . In IRAK4 KD, the 
hydrogen bond was maintained between Y262 and E233 in both 
the inactive and active models (Figure 7A), suggesting that the back 
pocket of the ATP-binding site may be completely inaccessible to 
ATP-competitive inhibitors in both active and inactive IRAK4 
kinases. However, mutation of the gatekeeper residue may have sub- 
stantial physiological significance. Mutation of the gatekeeper res- 
idue has been shown to be associated with drug resistance in chronic 
myeloid leukemia and several solid tumor patients 68 . Because the 
back pocket of the ATP-binding site is blocked in IRAK4 kinase, 
the extended front pocket of the ATP-binding area may have a direct 
impact on inhibitor design, compared to other protein kinases 32 . This 
may be true for all members of the IRAK family, as all have Tyr as the 
gatekeeper residue. 

IRAK4 kinase showed a DFG-in conformation when ATP was 
present and a DFG-out conformation for apo IRAK4 (Figure 8). 
The spatial arrangement of the HRD catalytic motif revealed the 
importance of R310 and D311 upon ATP binding and confirmed 
the active state of the enzyme (Supplementary Information Figure 
S5). The major structural differences in active IRAK4 KD are the loop 
between helices ocD and ocE (276-282) and helix aG and its adjoining 
loops (383-394 and 407-423). The position of helix aG is at a con- 
siderable distance from the active site and helix ocEF, with an approxi- 
mately 10-A shift compared to other Ser/Thr/Tyr kinases 6 . In 
addition, although the N-terminal extension of IRAK4 associates 
with the core structure of the kinase, no sequence similarity has been 
observed between the N-terminal extensions of various kinases 32,69 . 
Using PCA analysis, we observed the collective motions of three 
forms of IRAK4 KD. In the pIRAK4-ATP structure, pronounced 
dynamics were observed in the C lobe and in the loop regions of 
the N lobe. More strikingly, the dominant motions for the aDE loop 
and helix aG with its adjoining loops were observed upon activation 
(pIRAK4-ATP) and inactivation (uIRAK4). In particular, the dom- 
inant motion of helix aG in active (pIRAK4-ATP) IRAK4 kinase 
shows that helix aG and its adjoining loop play a crucial role in 
adapting the substrate for the catalytic mechanism upon activation 
(Figure 10). The dominant fluctuations observed for helix aG in the 
uIRAK4 model also suggest that helix aG plays a crucial role in the 
inactivation mechanism of IRAK4 kinase and that the loop regions 
within the IRAK4 kinase play a major role in the inter dynamics of 
IRAK4 kinase. 

We strongly believe that this type of computational approach is 
necessary to understand the structure-function relationship in pro- 
tein kinases. To our knowledge, this is the first study exploring the 
dynamic nature of both apo and ATP-bound IRAK4 kinase, which 
may provide some new insights for further research and drug 
development. 

Methods 

Apo and ATP-bound IRAK4 KD preparation. Crystal coordinates of active IRAK4 
KD in complex with inhibitor benzimidazole were downloaded from the Protein Data 
Bank (PDB) (PDB entry 2NRU). There is a crystal structure available for active 
IRAK4 KD in complex with p,*/-imidoadenosine 5 '-triphosphate lithium salt hydrate 



(AMP-PNP) (PDB entry 20ID). However, this structure has several missing residues, 
and the superimposition of both 2NRU and 20ID showed a root mean square 
deviation (RMSD) of 0.5 A. Hence, we used the 2NRU structure for further studies. 
First, we removed all the ligand and water molecules from the 2NRU crystal structure. 
In the 2NRU structure, only the residues T345 and S346 at the activation loop are 
phosphorylated; to construct fully phosphorylated IRAK4 KD, we additionally 
phosphorylated T342 in the activation loop using Discovery studio visualizer 
(Discovery Studio 2.1, Accelrys Inc., San Diego, CA, USA), as these three 
autophosphorylation sites are responsible for IRAK4 kinase activity 33 . To build the 
complex with ATP (pIRAK4-ATP), we superimposed the fully phosphorylated 
structure on to the crystal structure of 20ID, retrieved the AMP-PNP molecule from 
20ID, and modified the AMP-PNP ligand as the ATP molecule. Additionally, we 
included two magnesium ions (Mg 2+ ) based on the crystal structure of phosphorylase 
kinase (PDB entry 1PHK). We have included two magnesium ions because IRAK4 
kinase needs more than one Mg 2+ ion for its catalytic mechanism 70,71 . To construct 
pIRAK4, we removed the ATP and Mg 2+ ions from the complex structure pIRAK4- 
ATP. Although the phosphorylation mechanism of the kinase has substantial 
physiological importance 72 , no crystal coordinates are currently available for inactive 
IRAK4 KD, and it is not yet computationally possible to model the conformation of 
the inactive kinase with physiological significance. Hence, we constructed an 
unphosphorylated IRAK4 (uIRAK4) KD to represent inactive IRAK4 KD. To 
construct uIRAK4, we replaced the phosphothreonine and phosphoserine residues at 
T342, T345, and S346 with Thr and Ser from pIRAK4. A similar approach has been 
reported previously 45,53 . Eventually, we subjected the three different systems 
(uIRAK4, pIRAK4, and pIRAK4-ATP) to further dynamics simulations. 

Molecular dynamics (MD) simulations. Classical MD simulation was performed for 
UIRAK4, pIRAK4, and pIRAK4-ATP using the Gromacs 4.6 73,74 package. The 
Gromos43al force field and PRODRG2.5 75 server were used to generate the 
topologies for protein and ATP, respectively. The uIRAK4, pIRAKf, and pIRAK4- 
ATP systems were solvated in a corresponding box under the periodic boundary 
conditions using a distance of 1.0 nm from the protein to the surface of the box. To 
neutralize the system, counter ions were added accordingly. Each system was 
subjected to energy minimization using the steepest descent integrator without 
constraints for 1000 steps. Following the minimization step, systems were 
equilibrated under NVT (canonical ensemble) and NPT (isothermal-isobaric 
ensemble) conditions for 100 ps at 300 K after applying position restraints to the 
protein. Finally, a 200-ns production run was performed under NPT conditions by 
removing position restraints. The temperature and pressure of the system were 
maintained using the Berendsen weak-coupling method. Lennard-Jones potentials 
were used for Vander walls interactions, and electrostatic interactions were handled 
by PME calculations with a cut-off for the real space term of 0.9 nm 76 . The LINCS 
algorithm was used to constrain all the bonds 77 . A 2-fs time step was applied and 2-ps 
final coordinates were saved. Most of the analyses were performed using Gromacs 
inbuilt tools with 50-ps coordinate. The final low energy conformation structures 
were extracted from last 10 ns (190-200 ns) of each simulation, and this structure has 
been considered as a representative structure whenever necessary to compare the 
three different systems. The RMSD and root mean square fluctuation (RMSF) 
calculations were done by least square fit. All visualizations were performed using 
Chimera 78 , VMD 79 , and PyMOL (The PyMOL Molecular Graphics System, Version 
1.5.0.4 Schrodinger, LLC). All the graphs were plotted using Microsoft Excel. 

Principal component analysis. To obtain dominant and collective modes of the 
protein from the overall dynamics of the MD trajectory, PCA was performed. PCA is 
based on the construction of a mass-weighted covariance matrix of the atom 
displacement, and this covariance matrix is diagonalized to extract a set of 
eigenvectors and eigenvalues that reflect concerted motion of the molecule 80-82 . 
Eigenvectors represent the direction of motion, whereas the corresponding 
eigenvalues represent the amplitudes in those directions. The IRAK4 coordinates 
were extracted from each time frame of 50-ps MD trajectories from each system. 
Overall rotational and translational movements in the MD trajectory were removed 
by a least-squares fit superimposition on the average structure 83 . The Gromacs inbuilt 
tool g_covar was used to yield the eigenvalues and eigenvectors by calculating and 
diagonalizing the covariance matrix, whereas the g_anaeig tool was used to analyze 
and plot the eigenvectors. 
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